MLLNVLRICI XVCLVNDGAG KHSEGRERTK TYSLNSRGYF 40 

RKERGARRSK ILLVNTKGLD EPHIGHGDFG LVAELFDSTR 60 

THTNRKEPDM NKVKLFSTVA HGNKSARRKA YNGSRRNIFS 120 

RRSFDKRNTE VTEKPGAKMF WNNFLVKMNG APQNTSHGSK 160 

AQEIMKEACK TLPFTQNIVH ENCDRMVIQN NLCFGKCISL 200 

HVPNQQDRRN TCSHCLPSKF TLNHLTLNCT GSKNWKWM 240 

MVEECTCEAH KSNFHQTAQF NMDTSTTLHH 270 



Figure 1. Deduced amino acid sequence of Xenopus cerberus protein. 



Figure 2. Nucleotide sequence of the full-length cerberus DNA derived from the Xenopus 
organizer. The sense strand is on top (in the 5' to 3' direction) and the anusense strand on 
the bottom line (on the opposite direction). SEQ ID NO:2. 



GAATTCCCAG CAAGTCGCTC AGAAACACTG CAGGGTCTAG ATATCATACA ATGTTACTAA 
CTTAAGGGTC GTTCAGCGAG TCTTTGTGAC GTCCCAGATC TATAGTATGT TACAATGATT 

ATGTACTCAG GATCTGTATT ATCGTCTGCC TTGTGAATGA TGGAGCAGGA AAACACTCAG 
CTAGACATAA TAGCAGACGG AACACTTACT ACCTCGTCCT TTTGTGAGTC 

AAGGACGAGA AAGGACAAAA ACATATTCAC "AACAGCAG AGGTTACTTC AGAAAAGAAA 
TTCCTGCTCT TTCCTGTTTT TGTATAAGTG AATTGTCGTC TCCAATGAAG TCTTTTCTTT 

™r^r_rarr. TAGGAGCAAG ATTCTGCTGG TGAATACTAA AGGTCTTGAT GAACCCCACA 
SSS£ TAAGACGACC ACTTATGATT TCCAGAACTA CTTGGGGTGT 

TTGGGCATGG TGATTTTCGC TTAGTAGCTG "CTATTTGA "CCACCAGA ACACATACAA 
AACCCGTACC ACTAAAAGCG AATCATCGAC TTGATAAACT AAGGTGGTCT TGTGTATCiTi 

ceCAGACATG AACAAAGTCA AGCTTTTCTC AACAGTTGCC CATGGAAACA 

SSSSI? £S?£agt tcgaaaagag ttgtcaacgg GTACCTXTGT 
«»rT«-"G aagaaaagct tacaatggtt ctagaaggaa tatttttcct cgccgitctt 

£S££c ATGTTACCAA GATCTTCCTT ATAAAAAGGA GCGGCAAGAA 

TTGATAAAAG aaatacagag gttactgaaa agcctggtgc caagatgttc tggaacaatt 

AACTATTTTC TTTATGTCTC CAATGACTTT TCGGACCACG GTTCTACAAG ACCTTGTTAA 

b TTTTGGTTAA AATGAATGGA GCCCCACAGA ATACAAGCCA TGGCAGTAAA G^CAGGAAA 
H AAAACCAATT TTACTTACCT CGGGGTGTCT TATGTTCGGT ACCGTCATTT CGTGTCCTTT 

K TAATGAAAGA AGCTTGCAAA ACCTTGTTTT TCACTCAGAA TATTGTACAI GAAAACTGTG 
n ATTACTTTCT TCGAACGTTT TGGAACAAAA AGTGAGTCTT ATAACATGTA CTTTTGACAC 

ACAGGATGGT GATACAGAAC AATCTGTGCT TTGGTAAATG g ggg gg g™?^ 
TGTCCTACCA CTATGTCttKi VTAGACAOjA AAlUlIIIAC GTAOAGAGAG GTAvnB i iriiTT 
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ATCAGCAAGA TCGACGAAAT ACTTGTTCCC ATTGCTTGCC GTCCAAATTT ACCCTGAACC 
SSotSS SSSJa TGAACAAGGG TAACGAACGG CAGGTTTAAA TGGGACTTGG 

ACCTGACGCT GAATTGTACT GGATCTAAGA ATGTAGTAAA GGTTGTCATG ATGGTAGAGG 

iSS S?ISga cciagattct tacatcattt ccaacagtac taccatctcc 
ihkmCGTG tgaagctcat aagagcaact tccaccaaac tgcacagttt aacatggata 

SSJS ISSSI TTCTCGTTGA AGGTGGTTTG ACGTGTCAAA TTGTACCTAT 

™«mr*ic CCTGCACCAT TAAAGGACTG CCATACAGTA TGGAAATGCC CTTTTGTTGG 
SIgISg SSI ISScCTGAC GGTATGTCAT ACCTTTACGG GAAAACAACC 

avrvPTTGTT ACATACTATG CATCTAAAGC ATTATGTTGC CTTCTATTTC ATATAACCAC 
«I?IIaSI JSSgaSac GTAGATTTCG TAATACAACG GAAGATAAAG TATATTGGTG 

ATGGAATAAG GATTGTATGA ATTATAATTA ACAAATGGCA «TT^ S2S££ 
TACCTIATTC CTAACATACT TAATATTAAT TGTTTACCGT AAAACACATT GTACGTTCTA 
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# 



ass ss ssss sssss see ssisi 
ssss = = ssss SSK ssss 

AGGGACTAAG TTTGCCCAGG AGCAGTGACC CATAACAACC AATCAGCAGG TATGATTTAC 

Sto Hacgggtcc tcgtcactgg gtattgttgg ttagtcgtcc atactaaatg 

TGGTCACCTG TTTAAAAGCA AACATCTTAT TGGTTGCTAT GGGTTACTGC "CTGGGCAA 
ACCAGTGGAC AAATTTTCGT TTGTAGAATA ACCAACGATA CCCAATGACG AAGACCCGTT 

AVTGTGTGCC TCATAGGGGG GTTAGTGTGT TGTGTACTGA ATAAATTGTA »T»TTTCAT 
SISS ScCCCC CAATCACACA ACACATGACT TATTTAACAT AAATAAAGTA 



1080 
1140 
1200 
1260 
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TGTTACAAAA AAAAAAAA 
ACAATGTTTT TTTTTTTT 



a 

3 Fig. 2. (Continuation page 2, SEQ ID NO:2). 
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MSRTRKVDSL 


LLLAIPGLAL LLLPNAYCAS 


CEPVRIPMCK 


SMPWNMTKMP 


NHLHHSTQAN 


60 


AILAIEQFEG 


LLTTECSQDL LFFLCAMYAP 


ICTIDFQHEP 


IKPCKSVCER ARAGCEPILI 


120 


KYRHTWPESL 


ACEELPVYDR GVCISPEAIV TVEQGTDSMP 


DFSMDSKNGN 


CGSGREHCKC 


180 


KPMKATQKTY 


LKNNYNYVIR AKVKEVKVKC 


HDATAIVEVK 


EILKSSLVNI 


PKDTVTLYTN 


240 


SGCLCPQLVA 


NEEYIIMGYE DKERTRLLLV 


EGSLAEKWRD 


RLAKKVKRHD 


QKLRRPRKSK 


300 


DPVAPIPKKN 


SNSRQARS 











Figure 3. Deduced amino acid sequence of Xenopus frazzled protein. SEQ ID NO:3. 



1 



Fi«^ a Nucleotide sequence of the full-length frazzled cDNA derived from the Xenopus 
olg^r^ 

on the bottom line (opposite direction). SEQ ID NO:4. 



GAATTCCCTT TCACACAGGA CTCCTGGCAG AGGTGAATGG TTAGCCCTAT GGA«TGGTT 
CTTAAGGGAA AGTGTGTCCT GAGGACCGTC TCCACTTACC AATCGGGATA CCTAAACCAA 

TGTTGATTTT GACACATGAT TGATTGCTTT CAGATAGGAT TGAAGGACTT GGATTTTTAT 

ISaaaa CTGTGTACTA actaacgaaa gtctatccta acttcctgaa cctaaaaata 

CTAATTCTGC ACTTTTAAAT TATCTGAGTA ATTGTTCATT "GTATTGGA TGGGACTAAA 

Sttaagacg tgaaaattta atagactcat taacaagtaa aacataacct accctgattt 

GATAAACTTA ACTCCTTGCT TTTGACTTGC CCATAAACTA TAAGGTGGGG TGAGTTGTAG 
«2£SS TGAGGAACGA AAACTGAAOG GGTATTTGAT ATTCCACCCC ACTCAACATC 

TTGCTXTTAC ATGTGCCCAG ATTTTCCCTG ™TCCCTGT ATTCCCTCTA AAGTAAGCCT 
AACGAAAATG TACACGGGTC TAAAAGGGAC ATAAGGGACA TAAGGGAGAT TTCATTW**. 

ACACATACAG GTTGGGCAGA ATAACAATGT CTCGAACAAG GAAAGTGGAC TCA"ACTGC 
TGTGIATGTC CAACCCGTCT TATTGTTACA GAGCTTGTTC CTTTCACCTG AGIAATGACG 

TACT6GCCAT ACCTGGACTG GCGCTTCTCT TATTACCCAA TGCTTACTGT GCTTCGTGTG 
SSSSS ScTGAC CGCGAAGAGA ATAATGGGTT ACGAATGACA CGAAGCACAC 

™ AGCCTGTGCG GATCCCCATG TGCAAATCTA TGCCATGGAA CATGACCAAG ATGCC^CC 

Id iSS Saggggtac acgtttagat acggtacctt gtactggttc tacgggttgg 

ATCTCCACCA CAGCACTCAA GCCAATGCCA TCCTGGCAAT TGAACAGTTT 6AAGGTTTGC 
GTCGTGAGTT CGGTTACGGT AGGACCGTTA ACTTGTCAAA CTTCCAAACG 

TGACCACTGA ATGTAGCCAG GACCTTTTGT TCTTTCTGTG TGCCATGTAT GOCOOCATTT 
ACTGGTGACT TACATCGGTC CTGGAAAACA AGAAAGACAC ACGGTACATA CGGGGGTAAA 

GTACCATCGA TTTCCAGCAT GAACCAATTA AGCCTTGCAA GTCCGTGTGC «AAGGGCCA 
SSSS SSScS* GTTGOTfAAT TCGGAACGTT rMflCBfflflC CTTTCCOGGT 



480 
540 
600 
660 



GGGCCGGCTG TGAGCCCATT CTCATAAAGT ACCGGCACAC TTGGCCAGAG A ^<**AT 
SSScS ACTCGGGTAA GAGTATTTCA TGGCCGTGTG AACCGGTCTC TCGGACCGTA 

SSI SSSS SSSS SSSS ssss ssss 

SSSS SSSS SSSS SSSS SSSS ^LS? 

6AAGCGGCAG GGAGCACTGT AAATGCAAGC CCATGAAGGC ^CCCAAAAG AOGTATCTCJ 
CTTCGCCGTC CCTCGTGACA TTTACGTTCG GGTACTTCCG TTGGGTTTTC TGCATAWW.1 

AGAATAATTA CAATTATGTA ATCAGAGCAA AAGTGAAAGA GGTGAAAGTG ^JGCCAOG 
TCTTAITAAT GTTAATACAT TAGTCTCGTT TTCACTTTCT CCACTTTCAC TTTALMiii^ 

ACGCAACAGC AATTGTGGAA GTAAAGGAGA TTCTCAAGTC TTCCCTAGTG AACATT^TA 
TTAACACCTT CATTTCCTCT AAGAGTTCAG AAGGGATCAC TTGTAAGGAT 
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640 
900 
960 
1020 



1080 



1140 



XAGACACAGT GACACTGTAC ACCAACTCAG GCTGCTTGTG CCCCCAGCTT GTTGCCAATG 

£Stca ctgtgacatg tggttgagtc cgacgaacac gggggtcgaa caacggttac 

AGGAATACAT AATTATGGGC TATGAAGACA AAGAGCGTAC CAGGCTTCTA CTAGTGGAAG 
TCCTTATGTA TTAATACCCG ATACTTCTGT TTCTCGCATG GTCCGAAGAT GATCACCTTC 

GATCCTTGGC CGAAAAATGG AGAGATCGTC TTGCTAAGAA AGTCAAGCGC TG^TCAAA 1200 
CTAGGAACCG GCTTTTTACC TCTCTAGCAG AACGATTCTT TCAGTTCGCG ACCCTAGTTT 

AGCTTCGACG TCCCAGGAAA AGCAAAGACC CCGTGGCTCC AATTCCCAAC AAAAACAGCA 1260 

tSaagctgc agggtccttt tcgtttctgg ggcaccgagg ttaagggttg tttttgtcgt 

ATTCCAGACA AGCGCGTAGT TAGACTAACG GAAAGGTGTA TGGAAACTCT ATGGACTTTG 1320 
TAAGGTCTGT TCGCGCATCA ATCTGATTGC CTTTCCACAT ACCTTTGAGA TACCTGAAAC 

AAACTAAGAT TTGCATTGTT GGAAGAGCAA AAAAGAAATT GCACTACAGC ACGTTATATT 1380 
TTTGATTCTA AACGTAACAA CCTTCTCGTT TTTTCTTTAA CGTGATGTCG TGCAATATAA 

CTATTGTTTA CTACAAGAAG CTGGTTTAGT TGATTGTAGT TCTCCTTTCC ITCTTTIITT 1440 

S?HSaat gatgttcttc gaccaaatca actaacatca AGAGGAAAGG AAGAAAAAAA 

TTATAACTAT ATTTGCACGT GTTCCCAGGC AATTGTTTTA TTCAACTTCC AGTGACAGAG 1500 

"tISI taaacgtgca caagggtccg ttaacaaaat AAGTTGAAGG TCACTGTCTC 

CAGTGACTGA ATGTCTCAGC CTAAAGAAGC TCAATTCATT TCTGATCAAC TAATGGTGAC 1S60 
SSSgact TACAGAGTCG gatttcttcg AGTTAAGTAA agactagttg ATTACCACTG 

, . AAGTGTTTGA TACTTGGGGA AAGTGAACTA ATTGCAATGG TAAATCAGAG MAAOT^AC 1620 
lJ TTCACAAACT ATGAACCCCT TTCACTTGAT TAACGTTACC ATTTAGTCTC TTTTCAACTG 

f5 CAATGTTGCT TTTCCTGTAG ATGAACAAGT GAGAGATCAC ATTTAAATGA TGATCACTTT 1680 
id GTTACAACGA AAAGGACATC TACTTGTTCA CTCTCTAGTG TAAATTTACT ACTAGTGAAA 

CCATTTAATA CTTTCAGCAG TTTTAGTTAG ATGACATGTA GGATGCACCT AAATCTAAAT 1740 
GGTAAATTAT GAAAGTCGTC AAAATCAATC TACTGTACAT CCTACGTGGA TTTAGATTTA 

ATTTTATCAT AAATGAAGAG CTGGTTTAGA CTGTATGGTC ACTGTTGGGA AGGTAAATGC 1800 
TAAAATAGTA TTTACTTCTC GACCAAATCT GACATACCAG TGACAACCCT TCCATTTACG 

CTACTITGTC AATTCTGTTT TAAAAATTGC CTAAATAAAT ATTAAGTCCT AAATAAAAAA 1860 
SSSSSc TTAAGACAAA ATTTTTAACG GATTTATTTA TAATTCAGGA TTTATTTTTT 

AAAAAAAAAA AAAAA 
TTTTTTITTT TTTTT 



Fig. 4. (Continuation page 2, SEQ ID NO:4). 



MLLLFRAIPM LLLGLMVLQT DCEIAQYYID EEEPPGTVIA VLSQHSIFNT TDIPATNFRL 
MKQFNNSLIG VRESDGQLSI MERIDREQIC RQSLHCNLAL DWSFSKGHF KLLKVKVEVR 
DINDHSPHFP SEIMHVEVSE SSSVGTRIPL EIAIDEDVGS NSIQNFQISN MSHFSIDVLT 
RADGVKYADL VLMRELDREI QPTYIMELLA MDGGVPSLSG TAWNIRVLD FNDNSPVFER 
STIAVDLVED APLGYLLLEL HATDDDEGVN GEIVYGFSTL ASQEVRQLFK IHSRTGSVTL 
EGQVDFETKQ TYEFEVQAQD LGPNPLTATC KVTVHILDVN DNTPAITITP LTTVHAGVAY 
IPETATKEHF IALISTTDRA SGSHGQVRCT LYGHEHFKLQ QAYEDSYMIV TTSTLDRENI 
AAYSLTWAE DLGFPSLKTK KYYTVKVSDE NDHAPVFSKP QYEASILENN APGSYITTVI 
ARDSDSDQNG KVNYRLVDAK VMGQSLTTFV SLDADSGVLR AVRSLDYEKL KQLOFEIEAA 
DNGIPQLSTR VQLNLRIVDQ NDNCPVITNP LLNNGSGEVL LPISAPQNYL VFQLKAEOSD 
EGHNSQLFYT ILRDPSRLFA IHKESGEVFL KKQLNSDHSE DLSIWAVYD LGRPSLSTHA 
TVKFILTDSF PSNVEWILQ PSAEEQHQID MSIIFIAVLA GGCALLLLAI FFVACTCKKK 
AGEFKQVPEQ HGTCHEERLL SWSPQSV8S SLSQSESCQL SINTESENCS VSSNQEQHQQ 
TGIKHSISVP SYHTSGWHLD HCAMSISGHS HMGHISTKVQ WAKEIVTSMT VTLILVENQK 
RRALSSQCRH KPVLNTQMKQ QGSDMPITIS ATESTRVQKM GTAHCNMKRA IDCLTL 



Figure 5. Deduced amino acid sequence of the Xenopus PAPC (paraxial P~to»*«™> 
pX. It encodes a member of Se cadherin family of transmembrane protems fcat has 
dorsalizing activity when constructs are injected into Xenopus embryos. SEQ ID NO.5. 



SSS^SriS E — s™< (opposi,. LouW SBQ ID NO*. 
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CAATTCCCAG AGATGAACTC CTTGAGATTG TTTTAAATGA CTGCAGGTCT GGAAGGATTC 
Si^S TCTACTTGAG GAACTCTAAC AAAATTTACT GACGTCCAGA CCTTCCTAAG 

ACATTGCCAC ACTGTTTCTA GGCATGAAAA AACTGCAAGT TTCAACTTTG TTTTTGGTGC 
TGTAACGGTG TGACAAAGAT CCGTACTTTT TTGACGTTCA AAGTTGAAAC AAAAACCACG 

AACTTTGATT CTTCAAGATG CTGCTTCTCT TCAGAGCCAT TCCAATGCTG CTGTTGGGAC 
TTGAAACTAA GAAGTTCTAC GACGAAGAGA AGTCTCGGTA AGGTTACGAC GACAACCCTG 

TGATGGTTTT ACAAACAGAC TGTGAAATTG CCCAGTACTA CATAGATGAA G^GAACCCC 

ISIcIaIII tgtttgtctg acactttaac gggtcatgat gtatctactt CTTCTTGGGG 

CTGGCACTGT AATTGCAGTG TTGTCACAAC ACTCCATATT TAACACTACA GATATACCTG 
SSSSS TTAACGTCAC AACAGTGTTG TGAGGTATAA ATTGTGATGT CTATATGGAC 

" CAACCAATTT CCGTCTAATG AAGCAATTTA ATAATTCCCT TATCGGAGTC CGTGAGAGTG 360 
" S^SSIaa GGCAGATTAC TTCGTTAAAT TATTAAGGGA ATAGCCTCAG GCACTCTCAC 

S ATGGGCAGCT GAGCATCATG GAGAGGATTG ACCGGGAGCA AATCTGCAGG ^TCCCTTC 420 
W il^SicGA CTCGTAGTAC CTCTCCTAAC TGGCCCTCGT TTAGACGTCC GTCAGGGAAG 

m ACTGCAACCT GGCTTTGGAT GTGGTCAGCT TTTCCAAAGG ACACTTCAAG CnOOMG 48C 
W TGACGTTGGA CCGAAACCTA CACCAGTCGA AAAGGTTTCC TGTGAAGTTC GAAGACTTGC 

TGAAAGTGGA GGTGAGAGAC ATTAATGACC ATAGCCCTCA CTTTCCCAGT ^AATGC 54< 
□ ACTTTCACCT CCACTCTCTG TAATTACTGG TATCGGGAGT GAAAGGGTCA CTTTATTACG 

»ttt«-^GGT GTCTGAAAGT TCCTCTGTGG GCACCAGGAT TCCTTTAGAA ATTGCAATAG 

SSSI Iggagacacc CGTGGTCCTA aggaaatctt taacgttatc 



his 



ATGAAGATGT TGGGTCCAAC TCCATCCAGA ACTTTCAGAT CTCAAATAAT AGCCACTTCA 
SffSSS ACCCAGGTTG AGGTAGGTCT TGAAAGTCTA GAGTTTATTA TCGGTGAAGT 



600 
660 



GCATTGATGT GCTAACCAGA GCAGATGGGG TGAAATATGC AGATTTAGTC TTAATGAGAG 
CGTAACTACA CGATTGGTCT CGTCTACCCC ACTTTATACG TCTAAATCAG AATTACTCTC 

»»<~rftftlPAG GGAAATCCAG CCAACATACA TAATGGAGCT ACTAGCAATG GATGGGGGTG 
C^SIS GGTTGTATGT ATTACCTCGA TGATCGTTAC CTACCCCCAC 

mctgGTACT GCAGTGGTTA ACATCCGAGT CCTGGACTTT AATGATAACA 
JlSSIiS ^CACCAAT TGTAGGCTCA GGACCTGAAA TTACTATTGT 

GCCCAGTGTT TGAGAGAAGC ACCATTGCTG TGGACCTAGT AGAGGATGCT CCTCTGGGAT 
SSSI IcTCTCTTCG TGGTAACGAC ACCTGGATCA TCTCCTACGA GGAGACCCTA 

ACCTTTTGTT GGAGTTACAT GCTACTGACG ATGATGAAGG AGTGAATGGA ^"GTTT 
SIaSI CCTCAATGTA CGATGACTGC tACTACTICC TCACTTACCT CTTTAACAAA 

ramieAe CACTTTGGCA TCTCAAGAGG TACGTCAGCT ATTTAAAATT AACfCCAGAA 1020 

%££E£ gtgaIIcIS agagttctcc atgcagtcga taaattttaa ttgaggtctt 
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r~._r_rir.TGT TACTCTTGAA GGCCAAGTTG ATTTTGAGAC CAAGCAGACT TACGAATTTG 

SSI ISISS ccggttcaac taaaactctg gttcgtctga atgcttaaac 

iftTTACAAGC CCAAGATTTG GGCCCCAACC CACTGACTGC TACTTGTAAA GTAACTGTTC 

Stg^S S^SaIac ccggggttgg gtgactgacg atgaacattt cattgacaag 

ATATACTTGA TGTAAATGAT AATACCCCAG CCATCACTAT TACCCCTCTG ACTACTGTAA 
TATATGAACT acatttacta ttatggggtc GGTAGTGATA ATGGGGAGAC tgatgacatt 

ATGCAGGAGT TGCCTATATT CCAGAAACAG CCACAAAGGA GAACTTTATA GCTCTGATCA 
tacgtcctca acggatataa ggtctttgtc GGTGTTTCCI CTTGAAATAT CGAGACTAGT 

GCACTACTGA CAGAGCCTCT GGATCTAATG GACAAGTTCG CTGTACTCTT TATGGACATG 

Stgatgact gtctcggaga cctagattac ctgttcaagc gacatgagaa atacctgtac 

AGCACTTTAA ACTACAGCAA GCTTATGAGG ACAGTTACAT GATAGTTACC ACCTCTACTT 

TCGTGAAATT tgatgtcgtt cgaatactcc tgtcaatgta ctatcaatgg tggagatgaa 

_--. rsr v~ r , AAACATAGCA GCGTACTCTT TGACAGTAGT TGCAGAAGAC CTTGGCTTCC 
£££££ JiiSSSS CGCATGAGAA ACTGTCATCA ACGTCTTCTG GAACCGAAGG 

CCTCATTGAA GACCAAAAAG TACTACACAG TCAAGGTTAG TGATGAGAAT GACAATGCAC 
SScTT CTGGTTTTTC ATGATGTGTC AGTTCCAATC ACTACTCTTA CTGTTACGTG 

CTGTATTTTC TAAACCCCAG TATGAAGCTT CTATTCTGGA AAATAATGCT CCAGGCTCTT 
SSaag atttggggtc ATACTTCGAA gataagacct TITATTACGA GGTCCGAGAA 

ATATAACTAC AGTGATAGCC AGAGACTCTG ATAGTGATCA AAATGGCAAA GTAAATTACA 

JSattgatg TCACTATCGG tctctgagac tatcactagt tttaccgttt catttaatgt 

GACTTGTGGA TGCAAAAGTG ATGGGCCAGT CACTAACAAC ATTTGTTTCT CTTGA^ 
CTGAACACCT ACGTTTTCAC TACCCGGTCA GTGATTGTTG TAAACAAA6A GAACTACGCC 

ACTCTGGAGT ATTGAGAGCT GTTAGGTCTT TAGACTATGA AAAACTTAAA CAACTGGATT 

Saoctca taactctcga caatccagaa atctgatact TTTTGAATTT GTTGACCTAA 

TTGAAATTGA AGCTGCAGAC AATGGGATCC CTCAACTCTC CACTCGCGTT C^CTAAATC 
AACTTTAACT TCGACGTCTG TTACCCTAGG GAGTTGAGAG GTGAGCGCAA GTTGATTTAG 

TCAGAATAGT TGATCAAAAT GATAATTGCC CTGTGATAAC TAATCCTCTT CTT^^G 
.^t^i af^TACTTTTA CTATTAACGG GACACTATTG ATTAGGAGAA GAATTATTAC 
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1620 

1680 

1140 
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GCTCGGGTGA AGTTCTGCTT CCCATCAGCG CTCCTCAAAA CTATTTAGTT TTCCAGCTCA 
CGAGCCCACT TCAAGACGAA GGGTAGTCGC GAGGAGTTTT GAIAAATCAA AAGGTCGAGT 

AAGCCGAGGA ITCAGATGAA GGGCACAACT CCCAGCTGTT CTATACCATA CTGAGAGATC 
JicGGCTCCT AAGTCTACTT CCCGTGTTGA GGGTCGACAA GATATGGTAT GACTCTCTAG 

CAAGCAGATT GTTTGCCATT AACAAAGAAA GTGGTGAAGT GTTCCTGAAA AAACAATTAA 

caaacggtaa ttgtttcttt caccacttca caaggacttt tttgttaatt 

.rrrrrJLCCA TTCAGAGGAC TTGAGCATAG TAGTTGCAGT GTATGACTTG GGAAGACCTT 
SIS HS?S AACTCGTATC ATCAACGTCA CATACTGAAC CCTTCTGGAA 

CATTATCCAC CAATGCTACA GTTAAATTCA TCCTCACCGA CTCTTTTCCT ^AACGTTG 

Saataggtg gttacgatgt caatttaagt aggagtggct gagaaaagga agattgcaac 
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Fig. 6. (Continuation page 2, SEQ ID NO:6). 
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SIS SSSS ISSSS ss ssss ssss 
SSSSS S5S S5S ssss SSSS ssss 

GTACTTGTAA AAAGAAAGCT GGTGAATTTA AGCAGGTACC TGAACAACAC GGAACAT^A 

SSIIcatt tttctttcga ccacttaaat tcgtccatgg acttgttgtg ccttgtacgt 

ATGAAGAACG CCTGTTAAGC ACCCCATCTC CCCAGTCGGT CTCTTCTTCT "GTCTCAGT 

SSSS ggacaattcg tggggtagag gggtcagcca gagaagaaga aacagagtca 

CTGAGTCATG CCAACTCTCC ATCAATACTG AATCTGAGAA "GCAG^TG KCTCTAACC 
GACTCAGTAC GGTTGAGAGG TAGTTATGAC TTAGACTCTT AACGTCGCAC AGGAGATTGG 

UVCAGCAGCA TCAGCAAACA GGCATAAAGC ACTCCATCTC TGTACCATCT TATCACACAT 
SSSSS AGTCGTTTGT CCGTATTTCG TGAGGTAGAG ACATGGTAGA ATAGTGTGTA 

SSSS SSSSS SSSS SSSSS SSSSS SSSS 
SSSS SISSIS SSSS = SSSS SIS 
SSSS SSSSS SSSS SSSSS = SSSSS 
SSSSS SSSSS SSSSS 355S SSSS SSSS 
SSSS SSSSS SSSS SSS SSSS SSSSS 
SSSSS SSISSI SSSS SSSSS SSSSS SSSSS 

GAACCATACC CTTAGAGACC CTTATTACCA TATCAATAAT CCTGTT^TA ATCGGATGCA 
CTTGGTATGG GAATCTCTGG GAATAATGGT ATAGTTATTA G6ACAACGAT TAGCCTACGT 

GGCGGAATAT GAAAGAGATT TAGTCAACAG AAGTGCAACG TTATCTCCGC AGAGATCGTC 
^^Ita PTTTCTCTAA atcagttgtc TTCACGTTGC aatagaggcg tctctagcag 
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SSSS SSSS SSSSS SSSS SSSS SSSSS 
SSSS SSSS SSSS SSSS SSSS SSSS 

SSSS SSSS ss ss ssss s= 

SSSS SSSS = SSSS ssss SSSSS 
SSSSS SSSSS SSSS SSSSS SSSS SSISS 
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Fig. 6. (Continuation page 3, SEQ ID NO:6). 



ATTAAATCCA CAGACCTACA GTCAAATATT TGAGGGCCCC TGAAACAGCA CATCAGTCAG 

SStaggt GTCTGGATGT cagtttataa actcccgggg actttgtcgt gtagtcagtc 

™,™... rT GGCCTTTTTA CTTTTAGCAG CTCCTGGGTC TGCCCTCTGT GTTAATCAGC 

£££££ ££££ gIaaatcgtc gaggacccag acgggagaca caattagtcg 

CCCTGGTCAA GTCCTGAGTA GGATCATGGC GTTTTTATAT GCATCTCACC TACTTTGGAC 

SSgtt caggactcat cctagtaccg caaaaatata cgtagagtgg ATGAAACCTG 

GTGATTTACA CATAATAGGA AACGCTTGGT TTCAGTGAAG TCTGTGTTGT ATATATTCTG 

SSaUtgt gtattatcct ttgcgaacca aagtcacttc agacacaaca TATATAAGAC 

TTATATACAC GCATTTTGTG TTTGTGTATA TATTTCAAGT CCATTCAGAT ATCt^XTXT 

aItatatgtg cgtaaaacac aaacacatat ataaagttca ggtaagtcta tacacatata 

AGTGCAGACC TTGTAAATTA AATATTCTGA TACTTTTTCC TCAATAAATA TTTAAAT 
TCACGTCTGG AACATTTAAT TTATAAGACT ATGAAAAAGG AGTTATTTAT Juuun 
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Fig. 6. (Continuation page 4, SEQ ID NO:6). 




□ 



MVCCGPGRML LOWXGLLVIA ALCLLQVPGA QAAACEPVRI PLCKSLPWHM TKMPNHLHHS 
TQANAILAME QFEGLLGTHC SPDLLFFLCA MYAPICTIDF QHEPIKPCKS VCERARQGCE 
PILIKVKHSW PESLACDELP VYDRGVCISP EAIVTADGAD FPMDSSTGHC RGASSERCKC 
KPVRATQKTY FRNNYNYVIR AKVKEVKMKC HDVTAWEVK EIIjKASLVNI PRDTVNLYTT 
SGCLCPPLTV NEEYVIMGYE DEERSRLLLV EGSIAEKWKD RLGKKVKRWD MKLRHLGLGK 
TDASDSTQNQ KSGRNSNPRP ARS. 

Figure 7. Deduced amino acid sequence of mouse FRZB-1 protein. SEQ ID NO:7. 
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Figure 8. Nucleotide sequence of the full-length mouse FRZB-1 cDNA. SEQ ID NO:8. 

AAGCCTGGGA CCATGGTCTG CTGCGGCCCG GGACGGATGC TGCTAGGATG OeOOGGGTO 
TTCGGACCCT GGTACCAGAC GACGCCGGGC CCTGCCTACG ACGATCCTAC CCGGCCCAAC 

CTAGTCCTGG CTGCTCTCTG CCTGCTCCAG GTGCCCGGAG CTCAGGCTGC AGCCTGTGAG 
GATCAGGACC GACGAGAGAC GGACGAGGTC CACGGGCCTC GAGTCCGACG TCGGACACTC 

CCTCTCCGCA TCCCGCTGTG CAAGTCCCTT CCCTGGAACA TGACCAAGAT GCCCAACCAC 
GGACAGGCGT AGGGCGACAC GTTCAGGGAA GGGACCTTGT ACTGGTTCTA CGGGTTGGTG 

CTGCACCACA GCACCCAGGC TAACGCCATC CTGGCCATGG AACAGTTCGA AGGGCTGCTG 
GACGTGGTGT CGTGGGTCCG ATTGCGGTAG GACCGGTACC TTGTCAAGCT TCCCGACGAC 

GGCACCCACT GCAGCCCGGA TCTTCTCTTC TTCCTCTGTG CAATGTACGC ACCCATTTGC 

SSgtga cgtcgggcct agaagagaag aaggagacac gttacatgcg tgggtaaacg 

arr»TPQACT TCCAGCACGA GCCCATCAAG CCCTGCAAGT CTGTGTGTGA GCGCGCCCGA 

□ SJSSS cIIgtagttc gggacgttca gacacacact cgcgcgggct 

i. 3 

3 PARGGCTGCG AGCCCATTCT CATCAAGTAC CGCCACTCGT GGCCGGAAAG CTTGGCCTGC 

SScISc TCGGGTAAGA GTAGTTCATG GCGGTGAGCA CCGGCCTTTC GAACCGGACG 

GACGAGCTGC CGGTGTACGA CCGCGGCGTG TGCATCTCTC CTGAGGCCAT CGTCACCGCG 

S^Sg gSacatgct ggcgccgcac acgtagagag gactccggta gcagtcgcgc 

GACGGAGCGG ATTTTCCTAT GGATTCAAGT ACTGGACACT GCAGAGGGGC AAGCAGCGAA 
CTGCCTCGCC TAAAAGGATA CCTAAGTTCA TGACCTGTGA cgtctccccg TTCGTCGCTT 

CGTTGCAAAT GTAAGCCTGT CAGAGCTACA CAGAAGACCT ATTTCCGGAA CAATTACAAC 
CATTCGGACA GTCTCGATGT GTCTTCTGGA TAAAGGCCTT GTTAATGTTG 

TATGTCATCC GGGCTAAAGT TAAAGAGGTA AAGATGAAAT GTCATGATGT JACCG^ 
SftCAGTAOO CCCCATTTCft imTKCM TTCTACTTTA CAGTACTACA CTGGCGGCAA 

GTGGAAGTGA AGGAAATTCT AAAGGCATCA CTGGTAAACA TTCCAAGGGA CACCGTCAAT 
CACCTTCACT TCCTTTAAGA TTTCCGTAGT GACCATTTGT AAGGTTCCCT GTGGCAGTTA 
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AATCCCCGGC CAGCACGCAG CTAAATCCTG AAATGTAAAA GGCCACACCC ACGGACTCCC 
^ISgccg gtcgtgcgtc GATTTAGGAC tttacatttt CCGGTGTGGG TGCCTGAGGG 

TTCTAAGACT GGCGCTGGTG GACTAACAAA GGAAAACCGC ACAGTTGTGC TCGTGACCGA 

Ugattctga ccgcgaccac ctgattgttt ccttttggcg tgtcaacacg AGCACTGGCT 

TTGTTTACCG CAGACACCGC GTGGCTACCG AAGTTACTTC CGGTCCCCTT TCTCCTGCTT 

A^HIJalc gtctgtggcg caccgatggc ttcaatgaag GCCAGGGGAA agaggacgaa 

CTTAATGGCG TGGGGTTAGA TCCTTTAATA TGTTATATAT TCTGTTTCAT CAATCACGTG 
SIwUcGC ACCCCAATCT AGGAAATTAT ACAATATATA AGACAAAGTA GTTAGTGCAC 

GGGACTGTTC TTTTGCAACC AGAATAGTAA ATTAAATATG TTGATGCTAA GGTTTCTGTA 
SIcAAG AAAACGTTGG TCTTATCATT TAATTTATAC AACTACGATT CCAAAGACAT 

r-TGGACTCCC TGGGTTTAAT TTGGTGTTCT GTACCCTGAT TGAGAATGCA ATGTTTCATG 

SESSS ScIaHS AACCACAAGA CATGGGACTA actcttacgt tacaaagtac 

TAAAGAGAGA ATCCTGGTCA TATCTCAAGA ACTAGATATT GCTGTAAGAC AGCCTCTGCT 
ATTTCTCTCT TAGGACCAGT ATAGAGTTCT TGATCTATAA CGACATTCTG TCGGAGACGA 

GCTGCGCTTA TAGTCTTGTG TTTGTATGCC TTTGTCCATT TCCCTCATGC TGTGAAAGTT 
ATCAGAACAC AAACATACGG AAACAGGTAA AGGGAGTACG ACACTTTCAA 

ATACATGTTT ATAAAGGTAG AACGGCATTT TGAAATCAGA CACTGCACAA GCAGAGTAGC 

SSaaa tatttccatc ttgccgtaaa actttagtct GTGACGTGTT cgtctcatcg 

CCAACACCAG GAAGCATTTA TGAGGAAACG CCACACAGCA TGACTTATTT TCAAGATTGG 
£^GCTC CTTCGTAAAT ACTCCTTTGC GGTGTGTCGT ACTGAATAAA AGTTCTAACC 
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CAGGCAGCAA AATAAATAGT GTTGGGAGCC ^AAAAGAA TATTTTGCCT 
GTCCGTCGTT TTATTTATCA CAACCCTCGG TTCTTTTCTT ATAAAACGGA CCAATTCCCC 

ririrTWAA TCAGTAGCCC TTGAGCCATT AACAGCAGTG TTCTTCTGGC AAGTTTTTGA 

ISSiSS aactcggtaa ttgtcgtcac aagaagaccg ttcaaaaact 

TTTGTTCATA AATGTATTCA CGAGCATTAG AGATGAACTT ATAACTAGAC ATCTGTTGTT 

HISaSS £aStaagt gctcgtaatc tctacttgaa tattgatctg tagacaacaa 

ATCTCTATAG CTCTGCTTCC TTCTAAATCA AACCCATTGT TGGATGCTCC CTCTCCATTC 
TaSI£ GAGACGAAGG AAGATTTAGT TTGGGTAACA ACCTACGAGG GAGAGGTAAG 
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ATAAATAAAT TTGGCTTGCT GTATTGGCCA GGAAAAGAAA GTATTAAAGT ATGCATGCAT 1860 
iSSUEI AACCGAACGA CATAACCGGT CCTTTTCTTT CATAATTTCA TACGTACGTA 

GTGCACCAGG GTGTTATTTA ACAGAGGTAT GTAACTCTAT AAAAGACTAT AATTTACAGG 
CACGTGGTCC CACAATAAAT TGTCTCCATA CATTGAGATA TTTTCTGATA TTAAATGTCC 

ACACGGAAAT GTGCACATTT GTTTACTTTT TTTCTTCCTT TTGCTTTGG^ 
TGTGCCTTTA CACGTGTAAA CAAATGAAAA AAAGAAGGAA AACGAAACCC GAACACTAAA 

TGGTTTTTGG TGTGTTTATG TCTGTATTTT GGGGGGTGGG TAGGTTTAAG CCATTGCACA 

SaHHI? SSSac agacataaaa ccccccaccc atccaaattc ggtaacgtgt 

TTCAAGTTGA ACTAGATTAG AGTAGACTAG GCTCATTGGC CTAGACATTA TM^ 
AAGTTCAACT TGATCTAATC TCATCTGATC CGAGTAACCG GATCTGTAAT ACTAAACTTA 

TTGTGTTGTT TAATGCTCCA TCAAGATGTC TAATAAAAGG AATATGGTTG TCAACAGAGA 
AACACAACAA ATTACGAGGT AGTTCTACAG ATTATTTTCC TTATACCAAC AGTTGTCTCT 
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□ 



HVCGSPGGML LLRAGLLAIiA ALCLLRVPGA RAAACEPVRI PLCKSLPWNM TKMPNHLHHS 
TQANAILAIE QFEGLLGTHC SPDLLFFLCA MYAPICTIDF QHEPIKPCKS VCERARQGCE 
PILIKYKHSW PENLACEELP VYDRGVCISP EAIVTADGAD FPMDSSNGNC RGASSERCKC 
KPIRATQKTY FRNNYNYVIR AKVKEIKTKC HDVTAWEVK EILKSSLVNI PRDTVNLYTS 

SGCLCPPLNV NEE Y I IMG YE DEERSRLLLV EGSIAEKWKD RLGKKVKRWD MKLRHLGLSK 

SDSSNSDSTQ SQKSGRNSNP RQARN. 

Figure 9. Deduced amino acid sequence of human FRZB-1 protein. SEQ ID NO:9. 
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Figure 10. Nucleotide sequence of the full-length human FR2M cDNA. WEDNO. 10. 
This sequence was assembled from public ESTs from <"» <te^d««b« 
(accestL numbers: H18848, R63748, W38677, W44760. H38379 and N7 1244). 

GGCGGAGCGG GCC^mGGC GTCCACTGCG CGGCTGCACC CTOCCCCATC J***^ 
CCGCCTCGCC CGGAAAACCG CAGGTGACGC GCCGACGTGG GACGGGGTAG ACGGCCCTAG 

ATGGTCTGCG GCAGCCCGGG AGGGATGCTG CTGCTGCGGG CCGGGCTGCT TG^TGGCT 
TACCAGACGC CGTCGGGCCC TCCCTACGAC GACGACGCCC GGCCCGACGA ACGGGACCGA 

GCTCTCTGCC TGCTCCGGGT GCCCGGGGCT CGGGCTGCAG CCTGTGAGCC CGTCCGCATC 

cgagSS Icgaggccca cgggccccga gcccgacgtc ggacactcgg gcaggcgtag 

CCCCTGTGCA AGTCCCTCCC CTGGAACATG ACTAAGATGC CCAACCACCT GCACCACAGC 
SIScli TCAGGGACGG GACCTTGTAC TGATTCTACG GGTTGGTGGA CGTGGTGTCG 

ACTCAGGCCA ACGCCATCCT GGCCATCGAG CAGTTCGAAG GTCTGCTGGG CACCCACTGC 
£a1S£ TGCGGTAGGA CCGGTAGCTC GTCAAGCTTC CAGACGACCC GTGGGTGACG 

£25 EES =55 = =55 === 

™ S5£ SEES =35 = 55=5 

s= 5= =5 s= ss ™ 
-55= = =5= = =s =s 
i iaaa asss sssss esse ssss 
™ sss= =s= =5= =2 =s= 

SSSS =5= 5=35 5= =5= S5SS 
S555! =1 =5= 55555 =5= =5= 
— 555= =5= =5= 5= = 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



GATGAGGAAC GTTCCAGATT ACTCTTGGTG GAAGGCTCTA TAGCTGAGAA GTGGAAGGAT 900 
CTACTCCTTG CAAGGTCTAA TGAGAACCAC CTTCCGAGAT ATCGACTCTT CACCTTCCTA 

CGACTCGGTA AAAAAGTTAA GCGCTGGGAT ATGAAGCTTC GTCATCTTGG ACTCAGTAAA 960 
GCTGAGCCAT TTTTTCAATT CGCGACCCTA TACTTCGAAG CAGTAGAACC TGAGTCATTT 

AGTGATTCTA GCAATAGTGA TTCCACTCAG AGTCAGAAGT CTGGCAGGAA CTCGAACCCC 1020 
TCACTAAGAT CGTTATCACT AAGGTGAGTC TCAGTCTTCA GACCGTCCTT GAGCTTGGGG 

CGGCAAGCAC GCAACTAAAT CCCGAAATAC AAAAAGTAAC ACAGTGGACT TCCTATTAAG 1080 
GCCGTTCGTG CGTTGATTTA GGGCTTTATG TTTTTCATTG TGTCACCTGA AGGATAATTC 

ACTTACTTGC ATTGCTGGAC TAGCAAAGGA AAATTGCACT ATTGCACATC ATATTCTATT 1140 
TGAATGAACG TAACGACCTG ATCGTTTCCT TTTAACGTGA TAACGTGTAG TATAAGATAA 

GTTTACTATA AAAATCATGT GATAACTGAT TATTACTTCT GTTTCTCTTT TGGTTTCTGC 1200 
CAAATGATAT TTTTAGTACA CTATTGACTA ATAATGAAGA CAAAGAGAAA ACCAAAGACG 

TTCTCTCTTC TCTCAACCCC TTTGTAATGG TTTGGGGGCA GACTCTTAAG TATATTGTGA 1260 
H AAGAGAGAAG AGAGTTGGGG AAACATTACC AAACCCCCGT CTGAGAATTC ATATAACACT 

O GTTTTCTATT TCACTAATCA TGAGAAAAAC TGTTCTTTTG CAATAATAAT AAATTAAACA 1320 

□ CAAAAGATAA AGTGATTAGT ACTCTTTTTG ACAAGAAAAC GTTATTATTA TTTAATTTGT 

M TGCTGTTACC AGAGCCTCTT TGCTGAGTCT CCAGATGTTA ATTTACTTTC TGCACCCCAA 1380 

fj ACGACAATGG TCTCGGAGAA ACGACTCAGA GGTCTACAAT TAAATGAAAG ACGTGGGGTT 

T TTGGGAATGC AATATTGGAT GAAAAGAGAG GTTTCTGGTA TTCACAGAAA GCTAGATATG 1440 

□ AACCCTTACG TTATAACCTA CTTTTCTCTC CAAAGACCAT AAGTGTCTTT CGATCTATAC 

CCTTAAAACA TACTCTGCCG ATCTAATTAC AGCCTTATTT TTGTATGCCT TTTGGGCATT 1500 
fT GGAATTTTGT ATGAGACGGC TAGATTAATG TCGGAATAAA AACATACGGA AAACCCGTAA 

- CTCCTCATGC TTAGAAAGTT CCAAATGTTT ATAAAGGTAA AATGGCAGTT TGAAGTCAAA 1560 
— ** GAGGAGTACG AA'l'LWl'CAA OJ ' ITJL ' ACAAA TAX T TCCATT TTACGGTGAA ACCTCACTTT 

TGTCACATAG GCAAAGCAAT CAAGCACCAG GAAGTGTTTA TGAGGAAACA ACACCCAAGA 1620 
ACAGTGTATC CGTTTCGTTA GTTCGTGGTC CTTCACAAAT ACTCCTTTGT TGTGGGTTCT 

TGAATTATTT TTGAGACTGT CAGGAAGTAA AATAAATAGG AGCTTAAGAA AGAACATTTT 1680 
ACTTAATAAA AACTCTGACA GTCCTTCATT TTATTTATCC TCGAATTCTT TCTTGTAAAA 

GCCTGATTGA GAAGCACAAC TGAAACCAGT AGCCGCTGGG GTGTTAATGG TAGCATTCTT 1740 
CGGACTAACT CTTCGTGTTG ACTTTGGTCA TCGGCGACCC CACAATTACC ATCGTAAGAA 

CTTTTGGCAA TACATTTGAT TTGTTCATGA ATATATTAAT CAGCATTAGA GAAATGAATT 1800 
GAAAACCGTT ATGTAAACTA AACAAGTACT TATATAATTA GTCGTAATCT CTTTACTTAA 

ATAACTAGAC ATCTGCTGTT ATCACCATAG TTTTGTTTAA TTTGCTTCCT TTTAAATAAA 1860 
TATTGATCTG TAGACGACAA TAGTGGTATC AAAACAAATT AAACGAAGGA AAATTTATTT 



CCCATTGGTG AAAGTCAAAA AAAAAAAAAA AAA 
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